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Burkholderia  mallei  is  an  infectious  intracellular  pathogen 
whose  virulence  and  resistance  to  antibiotics  makes  it  a 
potential  bioterrorism  agent.  Given  its  genetic  origin  as  a 
commensal  soil  organism,  it  is  equipped  with  an  extensive 
and  varied  set  of  adapted  mechanisms  to  cope  with  and 
modulate  host-cell  environments.  One  essential  virulence 
mechanism  constitutes  the  specialized  secretion  systems 
that  are  designed  to  penetrate  host-cell  membranes  and 
insert  pathogen  proteins  directly  into  the  host  cell’s  cytosol. 
However,  the  secretion  systems’  proteins  and,  in  particular, 
their  host  targets  are  largely  uncharacterized.  Here,  we 
used  a  combined  in  silico,  in  vitro,  and  in  vivo  approach  to 
identify  B.  mallei  proteins  required  for  pathogenicity.  We 
used  bioinformatics  tools,  including  orthology  detection 
and  ab  initio  predictions  of  secretion  system  proteins,  as 
well  as  published  experimental  Burkholderia  data  to  ini¬ 
tially  select  a  small  number  of  proteins  as  putative  viru¬ 
lence  factors.  We  then  used  yeast  two-hybrid  assays 
against  normalized  whole  human  and  whole  murine  pro- 
teome  libraries  to  detect  and  identify  interactions  among 
each  of  these  bacterial  proteins  and  host  proteins.  Anal¬ 
ysis  of  such  interactions  provided  both  verification  of 
known  virulence  factors  and  identification  of  three  new 
putative  virulence  proteins.  We  successfully  created  in¬ 
sertion  mutants  for  each  of  these  three  proteins  using  the 
virulent  B.  mallei  ATCC  23344  strain.  We  exposed  BALB/c 
mice  to  mutant  strains  and  the  wild-type  strain  in  an  aer¬ 
osol  challenge  model  using  lethal  B.  mallei  doses.  In  each 
set  of  experiments,  mice  exposed  to  mutant  strains  sur¬ 
vived  for  the  21 -day  duration  of  the  experiment,  whereas 
mice  exposed  to  the  wild-type  strain  rapidly  died.  Given 
their  in  vivo  role  in  pathogenicity,  and  based  on  the  yeast 


From  the  ^Department  0f  Defense  Biotechnology  High  Perform¬ 
ance  Computing  Software  Applications  Institute,  Telemedicine  and 
Advanced  Technology  Research  Center,  U.S.  Army  Medical  Research 
and  Materiel  Command,  Fort  Detrick,  Maryland  21702;  §J.  Craig 
Venter  Institute,  Rockville,  Maryland  20850;  ^Bacteriology  Division, 
U.S.  Army  Medical  Research  Institute  of  Infectious  Diseases,  Fort 
Detrick,  Maryland  21702 
“  Author’s  Choice — Final  version  full  access. 

Received  March  8,  2013,  and  in  revised  form,  June  10,  2013 
Published,  MCP  Papers  in  Press,  June  23,  2013,  DOI  10.1074/mcp. 
Ml  13.029041 


two-hybrid  interaction  data,  these  results  point  to  the  im¬ 
portance  of  these  pathogen  proteins  in  modulating  host 
ubiquitination  pathways,  phagosomal  escape,  and  actin-cy- 
toskeleton  rearrangement  processes.  Molecular  &  Cellular 
Proteomics  12: 10.1074! mcp. Ml  13.029041,  3036-3051,  2013. 

Taken  out  of  their  natural  rhizosphere  environment,  many 
members  of  the  genus  Burkholderia  have  proven  adept  in 
surviving  and  adapting  to  many  diverse  environments.  Of 
particular  interest  are  the  two  closely  related  pathogenic 
members:  Burkholderia  mallei,  the  causative  agent  of  glan¬ 
ders,  a  disease  primarily  affecting  horses  but  transmittable  to 
humans;  and  Burkholderia  pseudomallei,  which  is  responsible 
for  melioidosis,  a  human  disease  endemic  to  Southeast  Asia 
and  Northern  Australia.  Human  infection  of  these  opportunis¬ 
tic  pathogens  occurs  through  ingestion,  inhalation,  or  skin 
abrasion.  Given  their  considerable  antibiotic  resistance,  ability 
to  infect  via  aerosol,  and  absence  of  vaccines,  these  patho¬ 
gens  form  both  an  emerging  public  health  threat  in  their 
natural  environment  and  a  potential  bioterrorism  threat  (1 , 2). 
B.  mallei,  the  focus  of  our  study  and  the  less  characterized  of 
the  two  species,  is  considered  a  deletion  clone  of  B.  pseu¬ 
domallei,  with  >1000  genes  lost  in  an  adaptation  to  equine 
hosts.  Thus,  the  genes  retained  in  B.  mallei  share  a  high 
sequence  similarity  to  genes  common  to  B.  pseudomallei  (3), 
and  many  virulence  factors  are  common  for  these  two  species 
(4).  Some  virulence  factors  present  in  B.  pseudomallei  were 
lost  in  B.  mallei’s  adaptation  to  its  obligatory  intracellular 
lifestyle  in  equids.  However,  the  mechanisms  and  capabilities 
required  for  successful  host  colonization  are  retained  in  B. 
mallei,  e.g.  quorum  sensing  (5,  6),  adhesion  (7),  capsular 
polysaccharide  gene  clusters  and  lipopolysaccharides  (8),  ac- 
tin-based  motility  (9,  10),  and  a  number  of  different  secretion 
systems  (11-14).  Secreted  proteins,  known  as  “effector”  pro¬ 
teins  interact  with  and  hijack  critical  human  proteins  and 
pathways,  which  allow  the  pathogen  to  survive  and  propa¬ 
gate  in  the  host  environment.  These  pathogen-induced  in¬ 
teractions  encompass  alteration  of  host  cell  signaling  (15), 
cytoskeletal  modulations  (16),  ubiquitin  modification  (17), 
autophagy  suppression  (4),  and  apoptotic/pyroptotic  con- 
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trol  (18).  An  apparently  distinctive  capability  of  B.  pseu¬ 
domallei  and  B.  mallei  is  their  tendency  to  form  multinucle- 
ated  giant  host-cell  complexes  as  a  means  for  the 
pathogen  to  spread  using  a  direct  cell-to-cell  traversal 
mechanism  (4,  9,  19).  All  of  these  capabilities  are  ultimately 
derived  from  natural  adaptations  and  survival  strategies 
evolved  for  living  in  the  rhizosphere  (20). 

Research  into  the  pathogenicity  of  B.  pseudomallei  and  B. 
mallei  has  focused  mostly  on  the  bacterial  aspect  of  infectiv- 
ity,  host  immune  responses,  and  disease  progression, 
whereas  the  fundamental  molecular  host-pathogen  interac¬ 
tions  that  mediate  these  activities  have  gathered  less  atten¬ 
tion.  Although  the  transcriptional  immune  response  in  the  host 
has  been  investigated  by  microarray  experiments  (21-23), 
most  of  the  proteins  used  by  the  pathogen  to  exert  its  influ¬ 
ence  are  not  known.  A  more  comprehensive  picture  of  how 
the  pathogen  interacts  with  host  cells  requires  novel  ap¬ 
proaches  that  combine  traditional  bacteriology,  high-through- 
put  experiments,  and  computational  methods.  In  this  study, 
we  used  a  combined  computational  and  experimental  ap¬ 
proach  to  systematically  identify  B.  mallei  proteins  and  their 
host  protein  interactions.  In  particular,  we  focused  on  B.  mal¬ 
lei  proteins  that  could  potentially  be  translocated  into  host 
cells  via  specific  secretion  systems.  We  then  screened  these 
bacterial  proteins  in  vitro  using  yeast  two-hybrid  (Y2H)1  as¬ 
says  against  both  human  and  murine  whole  proteomes.  A 
computational  analysis  of  these  interactions  provided  likely 
functional  targets  for  these  proteins.  Accordingly,  we  selected 
three  B.  mallei  genes  previously  not  tied  to  virulence,  sepa¬ 
rately  constructed  insertion  mutants  for  each  of  the  genes, 
and  infected  BALB/c  mice  in  an  aerosol  challenge  model  with 
the  mutants.  Whereas  mice  infected  with  the  wild-type  B. 
mallei  ATCC  23344  strain  rapidly  succumbed  to  the  infection 
and  died,  all  animals  separately  infected  with  each  of  the  three 
mutant  strains  survived  the  aerosol  challenge  for  the  duration 
of  the  experiment. 

These  results  implicated  three  B.  mallei  proteins,  not  pre¬ 
viously  considered  virulence  factors,  to  an  in  vivo  virulence 
phenotype.  Two  of  these  proteins  are  annotated  as  hypothet¬ 
ical  proteins  of  unknown  function,  and  we  used  the  Y2H  data 
to  characterize  their  putative  molecular  functions.  The  appar¬ 
ent  role  of  BMAA0728  relates  to  ubiquitination  and  phago- 
somal  escape  processes,  whereas  BMAA1865  activity  ap¬ 
pears  directed  toward  host  actin-cytoskeleton  rearrangement 
processes.  The  third  protein,  BMAA0553,  a  putative  phospha¬ 
tase  by  sequence  homology,  is  postulated  to  interfere  with 


1  The  abbreviations  used  are:  CFU,  Colony-forming  units;  GO, 
Gene  ontology;  IACUC,  Institutional  Animal  Care  and  Use  Committee; 
IBC,  Institutional  Biosafety  Committee;  KEGG,  Kyoto  encyclopedia  of 
genes  and  genomes;  LB,  Luria-Bertani;  NCBI,  National  Center  for 
Biotechnology  Information;  PPI,  Protein-protein  interaction;  T3SS, 
Type  3  secretion  system;  T6SS,  Type  6  secretion  system;  USAMRIID, 
United  States  Army  Medical  Research  Institute  for  Infectious  Dis¬ 
eases;  Y2H,  Yeast  two-hybrid. 


host  signaling  pathways  and  actin  cytoskeleton  rearrange¬ 
ment.  Based  on  our  current  understanding  of  Burkholderia 
spp.  host  infections,  these  findings  shed  complementary 
mechanistic  light  on  B.  mallei  pathogenicity  as  it  pertains  to 
host  cell  invasion  and  survival.  They  also  raise  the  possibility 
that  B.  mallei  virulence  factors  are  involved  in  more  than  one 
stage  of  the  bacterial  intracellular  life  cycle.  These  hypotheses 
and  observations  lay  the  foundation  for  future  biochemical 
and  immunological  work. 

EXPERIMENTAL  PROCEDURES 

Comparative  Genomics— We  used  comparative  genomics  to  1) 
identify  genes  that  discriminate  pathogenic  and  nonpathogenic 
strains  of  B.  mallei  and  2)  search  for  proteins  using  orthology  with 
known  virulence  factors.  In  these  computations,  we  used  a  recently 
published  orthology  detection  method  developed  by  our  group, 
termed  Quartets  (24,  25),  which  exploits  evolutionary  evidence  to 
predict  evolutionary  related  genes  in  a  computationally  efficient  man¬ 
ner,  enabling  whole  genome  comparisons. 

Functional  annotations  of  genomes  were  taken  from  GenBank  and 
Pathema  (26,  27).  For  hypothetical  or  unannotated  proteins,  we  gen¬ 
erated  function  hypotheses  using  the  algorithms  implemented  in  our 
integrated  protein  function  annotation  software,  PIPA  (28),  which 
combines  multiple  programs  and  databases  to  provide  a  unified 
terminology  and  consensus  function  annotation. 

Ab  Initio  Prediction  of  Type  3  Secretion  System  Proteins— As  se¬ 
cretion  system  proteins  frequently  diverged  beyond  recognition  by 
orthology,  specialized  methods  are  required  for  their  prediction.  Re¬ 
cently,  several  groups  have  described  computational  methods  to 
identify  type  3  secretion  system  (T3SS)  proteins  based  on  protein 
sequence  information  (29-32),  whereas  algorithms  for  the  prediction 
of  type  6  secretion  system  (T6SS)  proteins  are  not  available.  These  ab 
initio  methods  for  the  prediction  of  T3SS  proteins  use  machine  learn¬ 
ing  techniques  to  train  predictive  models  on  sets  of  known  secreted 
proteins  in  a  range  of  bacteria.  In  this  work,  we  used  two  such 
algorithms  and  reported  their  consensus  predictions.  The  first 
method,  EffectiveT3  (29),  uses  amino  acid  composition  and  di-  and 
tri-peptide  pattern  frequency  as  inputs  to  a  number  of  different  ma¬ 
chine  learning  algorithms,  whereas  the  second  method,  developed  by 
Lower  and  Schneider  (30),  uses  a  sliding  window  of  protein  sequence 
as  input  to  neural  networks  and  support  vector  machine  algorithms. 
For  the  EffectiveT3  software  (29),  we  used  a  probability  cutoff  of  0.995 
for  classification  as  a  T3SS  protein,  whereas  for  the  method  de¬ 
scribed  in  Ref.  (30)  we  used  a  0.6  cutoff. 

Y2H  Screens  to  Identify  Host-Pathogen  Protein-Protein  Interac¬ 
tions  (PPIs)-B.  mallei  genes  were  cloned  from  B.  mallei  ATCC  23344 
genomic  DNA  using  the  Gateway™  Entry  system,  pDONR™/Zeo 
(Invitrogen,  Carlsbad,  CA).  In  addition  to  clones  of  intact  genes,  11 
putative  virulent  factors  were  cloned  in  more  than  one  truncated  form 
(domain  fragments).  The  oligonucleotides  of  gene-specific  regions 
for  polymerase  chain  reaction  (PCR)  amplification  were  designed 
using  the  “OligoCalc:  Oligonucleotide  Properties  Calculator”  Web 
tool  (http://www.basic.northwestern.edu/biotools/oligocalc.html). 
To  ensure  the  specificity  of  the  PCRs,  the  gene-specific  regions  were 
designed  at  their  salt-adjusted  melting  temperature  near  60  °C.  PCR 
primers  incorporated  the  following  forward  and  reverse  recombina¬ 
tion  cloning  sequences:  attBt  (5'-GGGGACAAGTTTGTACAAAAAAG- 
CAGGCTTC-3')  and  attB2  (5'-GGGGACCACTTTGTACAAGAAAGCTG- 
GGTC-3'),  respectively.  The  oligonucleotides  were  purchased  from 
Invitrogen.  The  selected  open  reading  frame  (ORF)  sequences  were 
amplified  by  PCR  in  a  96-well  format.  The  PCR  amplification  was 
performed  in  a  volume  of  50  /xl  containing  2  ng  of  genomic  DNA,  200 
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nM  of  each  primer,  0.8  rriM  dNTPs,  and  Phusion  DNA  polymerase 
(New  England  Biolabs,  Ipswich,  MA).  After  denaturation  at  98  °C  for  1 
min,  the  incubation  conditions  were  performed  as  follows:  25  cycles 
of  30  s  at  98  °C,  30  s  at  55  °C,  1  min  at  72  °C,  and  a  final  10  min 
incubation  at  72  °C. 

The  amplified  ORFs  were  cloned  into  pDONR™/Zeo  by  recombi¬ 
nation  reactions  using  the  BP  Clonase™  II  Enzyme  Mix  (Invitrogen). 
The  reactions  were  performed  with  150  ng  of  amplified  inserts  and 
150  ng  of  pDONR™/Zeo  inTE  buffer  (pH  8.0)  and  incubated  overnight 
at  25  °C.  The  reaction  mixtures  were  incubated  with  1  ^g  proteinase 
K  for  30  min  at  37  °C.  Two  microliters  of  the  reaction  mixtures  were 
transformed  into  chemically  competent  DH5<*.  Transformants  were 
plated  on  Luria-Bertani  (LB)  agar-containing  zeocin  at  a  final  concen¬ 
tration  of  30  /j.g/ml  and  incubated  overnight.  Selected  transformants 
were  grown  in  LB  medium  with  zeocin  to  prepare  frozen  cultures  and 
for  plasmid  preparation.  The  cloned  plasmids  were  validated  by 
Sanger  DNA  sequencing. 

Target  genes  in  entry  clones  were  subcloned  into  NH2-terminal  and 
COOH-terminal  DNA-binding  domain  (bait)  Y2H  vectors,  pGBGT7g 
and  pGBKCg  (33),  using  Gateway  LR  reactions  (34).  LR  reactions 
were  performed  using  equal  quantities  of  the  entry  clone  plasmid  and 
the  destination  vector  of  35  fmol  in  a  10  /nl  volume  and  2  /nl  (1  x  unit) 
LR  clonase  II  enzyme  mix.  Reactions  were  incubated  for  16  h  at  37  °C 
and  terminated  by  an  incubation  with  2  p,g  proteinase  K  for  10  min  at 
37  °C.  The  destination  clones  were  transformed  into  DH5a  and  plated 
on  LB  agar  containing  30  p,g/ml  kanamycin.  After  bacterial  transfor¬ 
mation,  miniprep  plasmid  DNAs  of  all  the  bait  clones  were  trans¬ 
formed  into  Y2H  yeast  strain  AH109  ( MAT-a )  (35),  as  previously 
described  in  Ref.  (36). 

Before  the  two-hybrid  analyses,  all  bait  yeast  strains  were  exam¬ 
ined  for  auto-activation,  i.e.  detectable  bait-dependent  reporter  gene 
activation  in  the  absence  of  any  prey  interaction  partner.  Weak  to 
intermediate  strength  auto-activator  baits  can  be  used  in  two-hybrid 
array  screens  because  the  corresponding  bait-prey  interactions  con¬ 
fer  stronger  signals  than  the  auto-activation  background.  We  used  the 
HIS3  reporter  gene  and  suppressed  auto-activation  by  adding  differ¬ 
ent  concentrations  of  3-amino-1 ,2,4-triazole  (3-AT),  a  competitive 
inhibitor  of  HIS3.  All  of  the  B.  mallei  baits  were  inspected  for  auto¬ 
activation  on  plates  containing  different  concentrations  of  3-AT.  The 
lowest  concentration  of  3-AT  that  suppressed  growth  for  the  interac¬ 
tion  screen  was  used  (see  below),  because  it  avoided  background 
growth  while  still  detecting  true  interactions. 

A  haploid  yeast  strain  expressing  a  single  B.  mallei  protein  as  bait 
was  used  to  separately  perform  protein  interaction  screens  with  hu¬ 
man  and  mouse  normalized  universal  two-hybrid  cDNA  libraries  (cat¬ 
alog  nos.  630480  and  630482,  Clontech).  The  bait  and  prey  (cDNA 
library)  yeast  culture  was  grown  and  mixed  (at  a  1 :1  ratio)  and  plated 
on  YEPDA  agar  plates  (as  recommended  by  the  cDNA  library  pro¬ 
vider,  Clontech).  YEPDA  agar  plates  were  incubated  at  30  °C  for  6  h 
or  overnight  at  room  temperature.  During  this  process,  both  prey  and 
bait  plasmids  were  combined  in  the  diploid  yeast  cells  by  yeast 
matting.  The  cells  from  the  mating  plates  were  collected  and  trans¬ 
ferred  onto  interaction  selection  yeast  synthetic  medium  with  pre¬ 
defined  concentrations  of  3-AT  (media  lacking  tryptophan,  leucine, 
and  histidine  plus  3-AT),  and  plates  were  incubated  at  30  °C  for  4-6 
days.  The  interaction  selection  plates  that  showed  colony  growth  but 
no  colonies  on  control  plates  (bait  mated  to  empty  prey  vector)  were 
identified  as  the  two-hybrid  positive  yeast  clones.  Positive  yeast 
colonies  were  selected  either  manually  or  using  robotics  and  sub¬ 
jected  to  yeast  colony  PCR  followed  by  DNA  sequencing  to  identify 
the  interacting  preys  (36). 

Mapping  of  Host  Proteins  to  Pathways  and  Networks— All  compu¬ 
tational  analyses  were  performed  in  R,  using  Bioconductor/BioMart 
(37,  38)  for  GO  and  KEGG  analyses  and  the  “survival”  package  for 


survival  analysis  (Terry  Therneau,  “A  Package  for  Survival  Analysis  in 
S,”  R  package  version  2.36-14,  2012).  All  networks  were  plotted 
using  Cytoscape  (39). 

Protein-Protein  Interaction  Data— To  identify  conserved  PPIs  be¬ 
tween  the  B.  mallei- human  and  B.  mallei- murine  PPI  data  sets,  we 
first  identified  human-murine  orthologs  of  all  human  and  murine 
proteins  interacting  with  B.  mallei ,  using  the  National  Center  for 
Biotechnology  Information  (NCBI)  HomoloGene  database  of  ho¬ 
mologs  (http://www.ncbi.nlm.nih.gov/homologene)  (40,  41).  Then, 
we  denoted  host-pathogen  interactions  in  which  human  proteins 
interacted  with  the  same  B.  mallei  proteins  as  their  murine  orthologs 
as  conserved  interactions.  For  example,  the  interaction  between  B. 
mallei  protein  BMAA0553  and  human  protein  NPC2  was  conserved 
because  BMAA0553  also  interacted  with  murine  protein  Npc2,  and 
NPC2  and  Npc2  are  defined  as  human-murine  orthologs  in  the  NCBI 
HomoloGene  database. 

To  evaluate  the  influence  of  the  pathogen  on  host  proteins,  includ¬ 
ing  the  relationships  among  host  proteins  targeted  by  the  pathogen's 
proteins,  we  mapped  the  identified  host  proteins  interacting  with  B. 
mallei  proteins  onto  a  human  PPI  network  (42).  The  human  PPI 
network  consists  of  81 ,226  physical  PPIs  among  13,382  human  pro¬ 
teins.  Of  the  170  human  proteins  found  to  interact  with  BMAA0728, 
BMAA1865,  or  BMAA0553,  125  (—74%)  were  found  in  the  PPI  net¬ 
work.  Currently  available  murine  PPIs  data  are  largely  incomplete  and 
sparse,  so  we  decided  not  to  map  the  murine  proteins  onto  a  murine 
PPI  network.  Instead,  we  used  the  NCBI  HomoloGene  database  to 
identify  human  orthologs  of  murine  proteins  that  interacted  with  B. 
mallei  proteins  and  mapped  the  identified  orthologs  onto  the  human 
PPI  network.  Of  the  191  murine  proteins  interacting  with  BMAA0728, 
BMAA1865,  or  BMAA0553,  166  (87%)  had  human  orthologs  listed  in 
HomoloGene.  Of  the  336  human  proteins  (170  human  proteins  +  166 
murine  orthologs)  interacting  with  three  B.  mallei  proteins  of  interest, 
244  (—73%)  were  found  in  the  human  PPI  network. 

To  assign  protein-domain  annotation  to  B.  mallei ,  human,  and 
murine  proteins,  we  used  the  high-confidence  annotations  from 
PFAM-A  database  (43). 

Gene  Ontology  (GO)  Enrichment  Analysis— We  used  the  hypergeo¬ 
metric  distribution  to  assess  the  statistical  significance  of  observing  a 
given  GO  term  enrichment  in  our  data.  For  a  given  host  (human  or 
murine),  we  first  counted  the  total  number  of  proteins  ( N)  annotated 
with  any  GO  term.  Then,  for  each  GO  term,  g,  we  counted  the  total 
number  of  proteins  (m)  annotated  with  g.  Next,  we  calculated  the 
probability  (p)  of  observing  the  same,  or  higher,  enrichment  in  term  g 
by  chance,  as  follows: 


P  =  1 


k  - 1 

2 


i  =  0 


(m\(N-m\ 
\k  )\  n  -  k  ) 


(Eq.  1) 


where  n  represents  the  number  of  host  proteins  interacting  with  B. 
mallei  proteins  annotated  with  any  GO  term  and  k  denotes  the  number 
of  host  proteins  interacting  with  B.  mallei  proteins  annotated  with  g. 
For  the  total  number  of  human  proteins,  we  used  the  set  of  all  human 
and  murine  proteins  available  in  Bioconductor  and  BioMart  (37,  38) 
and  annotated  with  at  least  one  GO  term,  excluding  root  GO  terms. 

In  this  analysis,  we  only  considered  GO  annotations  that  were 
enriched  at  a  false  discovery  rate  control  level  of  0.2  or  below,  after 
accounting  for  multiple  hypothesis  testing  using  the  Benjamini  and 
Hochberg  multiple  test  correction  (44). 

Kyoto  Encyclopedia  of  Genes  and  Genomes  (KEGG)  Pathways 
Enrichment  Analysis— KEGG  pathways  are  not  independent  of  each 
other,  and  a  large  number  of  pathways  share  a  statistically  significant 
number  of  genes  (45).  Thus,  analyses  that  search  for  statistically 
significantly  enriched  pathways  may  identify  multiple  pathways  as 
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significant  simply  because  they  contain  a  large  number  of  shared 
genes  (e.g.  the  long-term  depression  pathway  shares  —60%  of  its 
genes  with  the  vascular  smooth  muscle  contraction  pathway)  and  not 
necessarily  because  of  shared  functionality.  As  these  nonpathway- 
specific  genes  are  not  necessarily  a  good  representation  of  a  specific 
biological  process,  we  focused  on  the  identified  pathways  that  are 
enriched  with  genes  that  mainly  belong  to  that  pathway  or  a  very 
small  number  of  other  pathways.  Thus,  to  identify  gene-specific 
KEGG  pathways,  we  used  the  hypergeometric  distribution  with  ad¬ 
justed  parameters  to  account  for  the  specificity  of  each  gene.  For  a 
given  host,  for  each  gene  /  in  KEGG,  we  counted  the  number  of 
pathways  in  which  the  gene  appears  and  denoted  this  number  as  kk 
We  then  defined  the  probability  of  each  gene  /  being  specific  (p,)  as 
follows: 

Pi  =  l  (Eq-2) 

Next,  we  defined  the  adjusted  population  size  of  all  host  proteins  in 
KEGG  ( An )  as  follows: 

Nkegg 

Av=  'ZPi  (Eq.  3) 

i 

where  the  summation  runs  over  all  NKEGG  host  proteins  that  appear  in 
at  least  one  KEGG  pathway.  We  then  defined  the  adjusted  number  of 
genes  in  a  pathway  P  (Am)  as: 

np 

Am='Zpj  (Eq.  4) 

i 

where  the  summation  runs  over  all  nP  host  proteins  that  appear  in 
pathway  P.  The  adjusted  total  number  of  interacting  host  proteins 
found  in  KEGG  pathways  (An)  was  defined  as: 

4,=  Eft  (Eq.  5) 

i 

where  the  summation  runs  over  all  nH  host  proteins  that  interact  with 
any  8.  mallei  protein  and  appears  in  any  pathway.  Finally,  the  ad¬ 
justed  number  of  host  proteins  interacting  with  B.  mallei  proteins  in 
pathway  P  (Ak)  was  defined  as: 

PHP 

\='Zpi  (Eq.  6) 

i 

where  the  summation  runs  over  all  nHP  interacting  host  proteins  in 
pathway  P. 

Because  the  adjusted  numbers  AN,  Am,  An,  and  Ak  are  real  num¬ 
bers,  we  converted  each  of  them  to  its  next  smallest  integer,  N ,  m,  n, 
and  k,  respectively,  and  calculated  the  probability  of  observing  the 
same  or  higher  specific  enrichment  in  pathway  P  purely  by  chance 
using  Eq.  1 .  For  the  total  set  of  human  proteins,  we  used  all  unique 
human  proteins  that  belong  to  at  least  one  KEGG  pathway. 

Bacterial  Strains,  Plasmids,  and  Growth  Conditions— Table  I  shows 
the  plasmids  and  bacterial  strains  used  in  this  study.  E.  coli  was 
grown  at  37  °C  on  LB  (Lennox)  agar  or  in  LB  broth.  B.  mallei  was 
grown  at  37  °C  on  LB  agar  with  4%  glycerol  or  in  LB  broth  with  4% 
glycerol.  All  bacterial  strains  were  grown  in  broth  for  — 18  h  with 
constant  agitation  at  250  revolutions/min.  When  appropriate,  antibi¬ 
otics  were  added  at  the  following  concentrations:  25  p.g  kanamycin 
per  milliliter  for  E.  coli  and  15  pg  kanamycin  and  25  p,g  polymyxin  B 
per  milliliter  for  B.  mallei.  Phosphate-buffered  saline  (PBS)  was  used 


to  make  serial  dilutions  of  saturated  bacterial  cultures,  and  the  num¬ 
ber  of  cfu  present  in  the  starting  culture  was  determined  by  spreading 
100  pi  aliquots  onto  agar  media  and  incubating  for  24-48  h.  A  20 
mg/ml  stock  solution  of  the  chromogenic  indicator  5-bromo-4-chloro- 
3-indolyl-b-D-galactoside  was  prepared  in  A/,A/-dimethylformamide, 
and  40  pi  were  spread  onto  the  surface  of  the  plate  medium  for 
blue/white  screening  in  E.  coli  TOPI  0.  All  manipulations  with  8.  mallei 
were  carried  out  in  a  class  II  microbiological  safety  cabinet  located  in 
a  designated  biosafety  level  3  laboratory. 

PCR  of  Internal  Gene  Fragments— Table  I  also  shows  the  PCR 
primers  used  to  amplify  internal  gene  fragments  of  BMAA0728, 
BMAA1865,  and  BMAA0553.  PCR  amplifications  were  performed  in  a 
final  reaction  volume  of  100  pi  containing  lx  TaqPCR  Master  Mix 
(Qiagen),  1  pM  oligodeoxyribonucleotide  primers,  and  —200  ng  of  8. 
mallei  ATCC  23344  genomic  DNA.  Bacterial  genomic  DNA  was  pre¬ 
pared  using  a  previously  described  protocol  (49).  PCR  cycling  was 
performed  using  a  PTC-150  MiniCycler  with  a  Hot  Bonnet  accessory 
(MJ  Research,  Inc.)  and  heated  to  97  °C  for  5  min.  This  was  followed 
by  30  cycles  of  a  three-temperature  cycling  protocol  (97  °C  for  30  s, 
55  °C  for  30  s,  and  72  °C  for  1  min)  and  1  cycle  at  72  °C  for  10  min. 
PCR  products  were  sized  and  isolated  using  agarose  gel  electropho¬ 
resis,  cloned  using  the  pCR2.1  -TOPO  TA  Cloning  Kit  (Invitrogen),  and 
transformed  into  chemically  competent  E.  coli  TOP10. 

DNA  Manipulation  and  Plasmid  Conjugation— The  restriction  en¬ 
zymes  EcoRI  and  A/ofl,  Antarctic  phosphatase,  and  T4  DNA  ligase 
were  purchased  from  Roche  Molecular  Biochemicals  and  used  ac¬ 
cording  to  the  manufacturer’s  instructions.  DNA  fragments  used  in 
cloning  procedures  were  excised  from  agarose  gels  and  purified  with 
a  GeneClean  III  kit  (Q  BlOgene).  Plasmids  were  purified  from  over¬ 
night  cultures  using  Wizard  Plus  SV  Minipreps  (Promega).  Plasmids 
pEXKm5-0728,  pEXKm5-1865,  and  pEXKm5-0553  (Table  I)  were 
electroporated  into  E.  coli  SI  7-1  (12.25  kV/cm)  and  conjugated  with 
8.  mallei  ATCC  23344  for  8  h,  as  previously  described  (8).  Polymyxin 
B  was  used  to  counter-select  E.  coli  SI  7-1  donor  strains. 

Animal  Experiments— Female  BALB/c  mice  were  obtained  from 
Charles  River  Laboratories  (National  Cancer  Institute)  and  were  6-8 
week  old  at  the  time  of  use.  Animals  were  provided  with  rodent  feed 
and  water  ad  libitum  and  maintained  on  a  12:12-hour  light-dark  cycle. 
Whole  body  exposures  to  aerosols  of  8.  mallei  were  performed  as 
previously  described  by  Roy  et  at.  (50).  Research  was  conducted  in 
compliance  with  the  Animal  Welfare  Act  and  other  federal  statutes 
and  regulations  relating  to  animals  and  experiments  involving  animals 
and  adhered  to  principles  stated  in  the  National  Institutes  of  Health 
Guide  for  the  Care  and  Use  of  Laboratory  Animals  (http://www. 
nap.edu/readingroom/books/labrats/chaps.html).  The  United  States 
Army  Medical  Research  Institute  for  Infectious  Diseases  (USAMRIID) 
Institutional  Animal  Care  and  Use  Committee  (IACUC)  reviewed  and 
approved  the  animal  protocol  entitled  “Evaluation  of  Burkholderia 
mallei  Secretion  Inhibitors  and  Identification  of  8.  mallei  and  8.  pseu¬ 
domallei  Virulence  Determinants  in  Mice  and  Hamsters.”  The  animal 
protocol  number  assigned  to  this  protocol  by  the  USAMRIID  IACUC 
was  AP-09-045.  The  facility  where  this  research  was  conducted  is 
fully  accredited  by  the  Association  for  Assessment  and  Accreditation 
of  Laboratory  Animal  Care  International. 

Survival  Analysis— We  used  the  Kaplan-Meier  method  (51)  to  gen¬ 
erate  survival  curves  from  the  number  of  surviving  mice  as  a  function 
of  time.  To  this  end,  Kaplan-Meier  survival  curves  can  be  interpreted 
as  the  probability  of  animals  surviving  a  given  length  of  time.  This 
probability  is  defined  as  the  cumulative  probability  of  surviving  k  time 
periods  [S(/c)j,  as  follows: 

S(k)  =  p,  ■  p2  ■  p3 - pk  (Eq.  7) 

where  p,  represents  the  proportion  of  animals  surviving  period  i.  Here, 
Pi  was  defined  as  follows: 
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Table  I 

Primers,  plasmids,  and  strains  used  in  the  mouse  aerosol  challenge  model  experiment.  Kmr,  kanamycin  resistant.  Kms,  kanamycin  sensitive. 

Pmr,  polymyxin  B  resistant.  Pms,  polymyxin  B  sensitive 


Primer,  plasmid, 
or  strain 

Relevant  characteristics 

Source  or 
reference 

Primers 

0728-up 

5'-GCGGCCGCACACATCCGGCACGCAACAG-3' 

This  study 

0728-dn 

5 ' -GCGGCCGCTCTCCAAAAGAATCCACAAC-3 ' 

This  study 

1865-up 

5 '  -GAATTCATCG  ACGAAATTCCCCGA-3 ' 

This  study 

1865-dn 

5 '  -GAATT  CGAGCTTGGCGAT  CT  CGAC-3 ' 

This  study 

0553-up 

5 '  -GAATTCG  AAGCCGTCGAGCTTTC-3 ' 

This  study 

0553-dn 

5 '  -GAATTCCG  AATAGTTCGCGCTGC-3 ' 

This  study 

Plasmids 

pCR2.1  -TOPO 

3,931 -bp  TA  vector;  pMBI  oriR;  Kmr 

Invitrogen 

pCR2. 1-0728 

pCR2.1-TOPO  containing  298-bp  PCR  product  generated  with  0728-up 
and  0728-dn 

This  study 

pCR2.1-1865 

pCR2.1-TOPO  containing  297-bp  PCR  product  generated  with  1865-up 
and  1865-dn 

This  study 

pCR2. 1-0553 

pCR2.1-TOPO  containing  535-bp  PCR  product  generated  with  0553-up 
and  0553-dn 

This  study 

pEXKm5 

Mobilizabile  Kmr  suicide  vector 

(46) 

pEXKm5-0728 

pEXKm5  derivative  containing  Not\  insert  from  pCR2. 1-0728 

This  study 

pEXKm5-1 865 

pEXKm5  derivative  containing  EcoRI  insert  from  pCR2.1-1865 

This  study 

pEXKm5-0553 

pEXKm5  derivative  containing  EcoRI  insert  from  pCR2. 1-0553 

This  study 

E.  coii 

TOPIO 

General  cloning  and  blue/white  screening 

Invitrogen 

SI  7-1 

Mobilizing  strain  with  transfer  genes  of  RP4  integrated  on  chromosome; 

Kms,  Pms 

(47) 

B.  mallei 

ATCC  23344 

Type  strain,  isolated  in  1944  from  a  human  case  of  glanders;  Kms,  Pmr 

(48) 

DDA0728 

ATCC  23344  derivative  with  pEXKm5-0728  integrated  into  the 
chromosome;  BMAA0728~ 

This  study 

DDA1865 

ATCC  23344  derivative  with  pEXKm5-1865  integrated  into  the 
chromosome;  BMAA1865~ 

This  study 

DDA0553 

ATCC  23344  derivative  with  pEXKm5-0553  integrated  into  the 
chromosome;  BMAA0553~ 

This  study 

n  -  d, 

pi=—  (Eci-8) 

where  r,  and  d,  denote  the  number  of  alive  mice  at  the  beginning  of 
time  period  i  and  the  number  of  deaths  within  period  /,  respectively. 

To  compare  murine  survival  curves,  we  used  the  following  log-rank 
test: 


this  work  is  composed  of  members  of  the  USAMRIID  research  staff, 
Commander’s  office,  and  qualified  representatives  from  external  in¬ 
stitutions  and  is  tasked  to  provide  local,  institutional  oversight  of 
research  utilizing  recombinant  DNA.  The  USAMRIID  IBC  was  estab¬ 
lished  under  the  United  States  (U.S.)  National  Institutes  of  Health 
Guidelines  for  Research  Involving  Recombinant  DNA  Molecules. 


(°.-M  (*-$■)' 

^(logRank)  = - 1 - (Eq.  9) 

k  ,  k  i 

V,C/,  sr^di 

ljrv  2j-ra 

i  =  i  1  i-i  ' 

where  1  and  2  represent  the  set  of  mice  infected  with  B.  mallei  mutant 
(set  1)  and  the  set  of  mice  infected  with  wild-type  B.  mallei  (set  2),  rl: 
and  r2i  represent  the  number  of  alive  mice  from  sets  1  and  2  at  time 
period  /,  respectively,  and  O,  and  02  represent  the  total  number  of 
observed  events  in  each  of  the  two  sets,  respectively. 

Biosafety  and  Biosecurity—  We  do  not  anticipate  that  this  manu¬ 
script  provides  knowledge,  products,  or  technologies  that  could  be 
directly  misapplied  by  others  to  pose  a  threat  to  public  health  and 
safety,  agricultural  crops  and  other  plants,  animals,  the  environment, 
or  materiel.  The  Institutional  Biosafety  Committee  (IBC)  that  approved 


RESULTS 

Figure  1  outlines  the  components  and  steps  of  our  com¬ 
bined  in  silico,  in  vitro,  and  in  vivo  effort  to  identify  B.  mallei 
virulence  factors  via  a  systematic  analysis  of  host-pathogen 
protein  interactions.  In  the  first  step,  we  used  a  set  of  bioin¬ 
formatics  tools  for  in  silico  identification  of  putative  virulence 
factors,  including  orthology  detection  based  on  pathogen 
proteins  experimentally  linked  to  pathogenicity  in  other  spe¬ 
cies  and  ab  initio  prediction  methods.  We  complemented 
these  methods  with  literature  review  of  published  experimen¬ 
tal  data,  resulting  in  a  pruned  list  of  49  proteins.  In  the  second 
step,  we  screened  the  selected  proteins  in  vitro  for  potential 
interaction  partners,  using  Y2H  assays  against  normalized 
human  and  murine  whole  proteome  libraries,  and  identified 
host-pathogen  protein-protein  interactions  (PPIs).  In  the  third 
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Bioinformatics  strategies  to  identify 
B.  mallei  proteins  involved  in  virulence 


P.  aeruginosa 


In  vivo  validation  using  animal  models 

Construction  of  B.  mallei  insertion  mutants  for  3  genes 
and  their  testing  in  mouse  aerosol  challenge  model 


BALB/c  mice 
N  =  10  each 


BMAA0728 


,  * 

[  B.  mallei  \ 

<  i  f  B.  mallei  1 

I 


BMAA1865 


BMAA0553 


3  proteins 


Yeast  two-hybrid  host-pathogen  protein 
interaction  screens 


Pathogen  proteins 


Human  library 


•  •  •  •  „ 

12  3  ...  7.5-10 

Murine  library 

•  •  •  •  6 

12  3  ...  2.8-10 

]  Host-pathogen 

protein  interactions 


Selection  of  proteins  for  animal  experimentation 

Analysis  of  conserved  interactions  and  prevalent  characteristics 
of  host  proteins  interacting  with  selected  pathogen  proteins 


KEGG  pathways 


Integrate  findings  and  delineate  mechanisms  of  pathogenicity 


Fig.  1 .  The  combined  in  silico,  in  vitro,  and  in  vivo  approach.  We  used  bioinformatics  tools  to  identify  49  proteins  putatively  involved  in 
virulence.  These  proteins  were  then  screened  for  interaction  partners  using  yeast  two-hybrid  (Y2H)  assays  against  whole  human  and  whole 
murine  proteome  libraries.  We  mapped  the  identified  host-pathogen  protein-protein  interactions  (PPIs)  to  host  pathways  and  host  PPI 
networks.  The  host  proteins  interacting  with  Burkholderia  mallei  proteins  were  characterized  with  respect  to  their  functions  and  possible  role 
in  pathogenicity.  This  analysis  resulted  in  the  identification  of  a  three  promising  virulence  factor  candidates.  We  verified  that  mutants  lacking 
these  proteins  showed  attenuated  virulence  in  a  mouse  aerosol  challenge  model.  Finally,  we  used  all  obtained  data  to  delineate  mechanisms 
of  pathogenicity  and  generated  hypotheses  about  the  potential  roles  of  these  three  proteins  in  B.  mallei  invasion  of  the  host  cell. 


step,  we  mapped  interactions  conserved  in  both  hosts  to 
canonical  pathways  and  human  PPI  networks  and  proposed 
three  B.  mallei  proteins  (encoded  by  BMAA0728,  BMAA1865, 
and  BMAA0553),  as  likely  to  interfere  with  host  processes  and 
pathways  associated  with  bacterial  pathogenicity.  In  the 
fourth  step,  we  constructed  B.  mallei  insertion  mutant  strains 
for  each  of  the  three  genes  and  verified  their  biological  rele¬ 
vance  in  vivo  using  a  BALB/c  murine  aerosol  infection  model, 
where  we  ascertained  and  compared  virulence  attenuation  of 
the  mutant  strains  with  the  B.  mallei  ATCC  23344  wild-type 
strain.  The  results  of  these  studies  are  detailed  below. 

In  Silico  Identification  of  Putative  Virulent  Factors— To  cre¬ 
ate  an  initial  set  of  possible  virulence  factors,  we  deployed  three 
independent  bioinformatics  analyses  based  on  the  identification 
of:  1)  genes  present  in  pathogenic  and  absent  in  nonpathogenic 
strains  of  B.  mallei,  2)  orthologous  genes  that  have  been  asso¬ 
ciated  with  secretion  systems  in  other  bacteria,  and  3)  type  3 
secretion  system  (T3SS)-associated  secreted  proteins,  using 
amino  acid  pattern-based  ab  initio  methods.  This  extensive  set 


of  proteins  was  then  examined  within  the  context  of  literature 
searches  and  published  microarray  experimental  data  to  weigh, 
rank,  and  group  the  collected  evidence  into  a  final,  pruned  set  of 
in  silico  predicted  virulence  factors. 

In  the  first  analysis,  we  collected  genomic  data  for  B.  mallei 
and  related  bacterial  genomes  using  Pathema  (27)  for  finished 
and  draft  B.  mallei  genomes  and  GenBank  (26)  for  all  other 
finished  genomes.  We  used  Quartets  (25)  to  identify  ortholo¬ 
gous  proteins  in  the  seven  strains  of  B.  mallei  reported  by 
Losada  et  al.  (52).  Table  II  shows  the  four  pathogenic  and 
three  nonpathogenic/attenuated  strains  used  in  this  study. 
We  identified  orthologous  pairs  of  genes  between  each  two 
strains  and  identified  78  genes  that  were  either  present  in  all 
pathogenic  strains  or  overrepresented  in  the  pathogenic 
strains.  Thirty-two  of  these  orthologs  were  present  in  our  B. 
mallei  ATCC  23344  target  strain  and  had  no  orthologs  among 
the  nonpathogenic  strains.  Even  though  this  set  should  be 
enriched  with  proteins  associated  with  pathogenicity,  the  bulk 
of  these  32  genes  were  annotated  with  hypothetical  func- 
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Table  II 

Strains  of  Burkholderia  mallei  used  to  identify  proteins  overrepresented  in  the  pathogenic  strains 


Genome 

Size,  Mb 

PATRIC  coding  sequences3 

RefSeq  coding  sequences* 

Finished 

Pathogenic 

B.  mallei  ATCC  23344 

5.83 

5724 

5024 

Yes 

Yes 

B.  mallei  NCTC  10229 

5.74 

5591 

5510 

Yes 

Yes 

B.  mallei  ATCC  10399 

5.88 

5850 

5276 

No 

Yes 

B.  mallei  PRL-20 

5.62 

5614 

5456 

No 

Yes 

B.  mallei  SAVP1 

5.23 

5095 

5189 

Yes 

No 

B.  mallei  2002721280 

5.51 

5590 

4847 

No 

No 

B.  mallei  NCTC  10247 

5.84 

5717 

5852 

Yes 

Attenuated 

3  This  column  shows  the  number  of  coding  sequences  for  each  of  the  genomes  that  are  available  in  the  PATRIC  database  (53). 
b  This  column  shows  the  number  of  coding  sequences  available  in  the  National  Center  for  Biotechnology  Information  Reference  Sequence 


database  (54). 


tions  or  with  functions  that  could  not  readily  be  associated 
with  pathogenicity  (supplemental  Table  SI)  and,  hence, 
their  actual  role  in  Burkholderia  pathogenicity  remains  to  be 
determined. 

In  the  second  analysis,  we  examined  gene  clusters  associ¬ 
ated  with  bacterial  secretion  systems  (55).  In  particular,  pre¬ 
vious  experimental  work  in  animal  models  has  indicated  that 
the  T3SS  (14)  and  the  type  6  secretion  system  (T6SS)  (11,  13) 
are  involved  in  virulence  of  B.  mallei  ATCC  23344.  Recent 
publications  have  also  elucidated  the  role  of  T3SS  (56)  and 
T6SS  (11)  in  the  virulence  of  B.  pseudomallei.  We  used  avail¬ 
able  literature  and  database  information  to  compile  a  list  of 
known  and  predicted  proteins  associated  with  bacterial  se¬ 
cretion.  Then,  we  used  Quartets  to  identify  24  orthologs  of 
these  proteins  present  in  the  B.  mallei  ATCC  23344  strain, 
supplemental  Table  S2  gives  the  gene  locus,  name,  secretion 
system,  and  evidence  used  to  identify  this  set  of  proteins. 

Finally,  in  the  third  analysis,  we  used  two  T3SS  protein 
prediction  methods  to  identify  B.  mallei  proteins  that  share 
characteristics  with  known  T3SS  proteins  and,  hence,  repre¬ 
sent  potential  virulence  factors  (29,  30).  We  selected  proteins 
that  were  common  to  both  prediction  methods  and  located  on 
the  second  chromosome,  as  the  known  6.  mallei  T3SS  is 
located  on  this  chromosome.  This  analysis  identified  37  pro¬ 
teins  (supplemental  Table  S3). 

To  select  proteins  for  experimental  evaluation,  we  first 
merged  the  lists  obtained  in  three  independent  analyses 
above,  and  revised  the  merged  list  using  published  microarray 
experimental  data.  In  particular,  we  compiled  the  available 
microarray  and  animal  model  experiments  for  B.  mallei  and 
the  closely  related  bacteria  B.  pseudomallei ,  B.  cenocepacia, 
and  B.  thailartdertsis  (13, 22,  48,  57-59).  We  identified  proteins 
that  were  overexpressed  during  infections  and  in  disease 
states  and,  hence,  considered  relevant  for  pathogenicity. 
Next,  we  used  Quartets  to  detect  orthologs  of  the  overex¬ 
pressed  proteins  that  are  present  in  the  merged  list  of  B. 
mallei  proteins.  This  procedure  provided  additional  evidence 
for  pathogenicity  among  of  our  identified  proteins.  Finally,  we 
individually  assessed  and  classified  each  of  the  proteins  and 
created  a  final  list  of  49  potential  virulence  factors  for  exper¬ 
imental  evaluation.  The  final  list  was  divided  into  four  groups 


based  on  the  assessed  reliability  of  evidences.  Table  III  shows 
the  49  proteins,  18  of  which  we  classified  into  the  most 
reliable  group  ( group  1).  All  identified  proteins  and  their  asso¬ 
ciated  evidence  for  involvement  in  pathogenicity  are  further 
detailed  in  supplemental  Table  S4. 

Y2H  Screening  Results  and  Data  Analysis— Out  of  the  49 
selected  proteins,  43  of  the  corresponding  genes  (or  gene 
fragments)  were  successfully  cloned,  prepared,  and  tested  in 
Y2FI  assays  against  both  human  and  murine  whole  proteome 
libraries.  The  raw  interaction  data  set  consisted  of  637  PPIs 
between  human  and  B.  mallei  proteins  and  900  PPIs  between 
murine  and  B.  mallei  proteins.  We  filtered  these  data  sets  by 
removing  protein  interactions  between  B.  mallei  and  “sticky” 
host  proteins  known  to  be  indiscriminate  binders  (F.  Schwarz 
and  M.  Koegl,  unpublished  data).  The  filtered  data  consisted 
of  600  unique  interactions  between  human  and  B.  mallei 
proteins  and  846  unique  interactions  between  murine  and  B. 
mallei  proteins  (supplementary  Data  set  SI  and  supplemen¬ 
tary  Data  set  S2).  Fig.  2  shows  the  host-pathogen  protein 
interactions  and  Table  III  lists  the  number  of  Y2H  interactions 
we  detected  for  each  tested  protein.  These  1446  PPIs  in¬ 
volved  26  unique  B.  mallei  proteins,  409  unique  human  pro¬ 
teins,  and  574  unique  murine  proteins.  Nineteen  of  the  26  B. 
mallei  proteins  interacted  with  proteins  of  both  hosts,  whereas 
two  B.  mallei  proteins  interacted  only  with  human  proteins 
and  five  B.  mallei  proteins  interacted  only  with  murine  pro¬ 
teins.  Furthermore,  we  found  33  conserved  PPIs  between 
these  two  data  sets,  i.e.  interactions  in  which  the  human 
proteins  interacted  with  the  same  B.  mallei  proteins  as  their 
murine  orthologs.  Finally,  we  found  that  16  human  proteins 
(9%)  and  21  murine  proteins  (11%)  interacted  with  multiple 
pathogen  proteins,  implying  that  these  proteins  may  be  in¬ 
volved  in  more  than  one  stage  of  the  infectious  process  and, 
potentially,  have  a  multivariate  pathogenic  functionality. 

Selection  of  Proteins  for  Animal-model  Experiments— To 
identify  the  likely  virulence  factors  for  validation  in  animal 
experiments,  we  focused  on  the  19  B.  mallei  proteins  that 
interacted  with  both  hosts.  We  considered  this  to  be  a  rea¬ 
sonable  down  selection  criterion  because  murine  animal 
models  of  S.  mallei  infection  share  many  pathophysiological 
characteristics  with  human  infections  (62,  63).  Thirteen  of 
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Table  III 

List  of  proteins  selected  for  experimental  yeast  two-hybrid  evaluation.  * Proteins  that  have  been  experimentally  linked  to  a  particular 
secretion  system.  Genes  associated  with  secretion  system  are  classified  according  to  their  secretion  system  type  and 

chromosomal  locations 


Group 

Locus  tag 

Protein  name 

Secretion 

system 

Successfully 

cloned 

Number  of  PPIs  detected 

Human 

Murine 

Conserved 

1 

BMAA041 0* 

VgrG 

6/c3a 

Yes 

1 

BMAA0445* 

VgrG 

6/c2a 

Yes 

2 

1 

7 

BMAA0446* 

VgrG 

6/c2a 

Yes 

7 

7 

1 

BMAA0737* 

VgrG 

6/c1a 

No 

7 

BMAA1901* 

VgrG 

6/c4a 

Yes 

7 

BMAA0749* 

BimA 

5b 

Yes 

80 

199 

5 

7 

BMA0278* 

PilA 

- 

Yes 

42 

64 

2 

7 

BMAA0079* 

PIcN 

2® 

Yes 

7 

BMAA1521* 

BopA 

3d 

Yes 

1 

1 

7 

BMAA1523* 

BopE 

3d 

Yes 

27 

7 

BMAA1525* 

BapB 

3d 

Yes 

9 

13 

7 

BMAA1528* 

BipD 

3d 

Yes 

41 

49 

3 

7 

BMAA1530* 

BipC 

3d 

Yes 

2 

7 

BMAA1531* 

BipB 

3d 

Yes 

24 

25 

1 

7 

BMA0855* 

VgrG  Rhs  domain 

6® 

No 

7 

BMAA1269 

VgrG 

6® 

Yes 

1 

1 

7 

BMAA1538* 

BsaU 

3d 

Yes 

14 

42 

1 

7 

BMAA0729* 

TssM 

6/c1a 

Yes 

2 

2 

BMA2097 

Hypothetical  protein 

3f 

Yes 

2 

BMAA1865 

Hypothetical  protein 

3f 

Yes 

61 

89 

6 

2 

BMAA1899 

Hypothetical  protein 

3f 

No 

2 

BMA0666 

CysD-1 

3f 

Yes 

3 

2 

BMAA1487 

Methyl-accepting  chemotaxis  protein 

3f 

No 

2 

BMA3281 

FliF 

3f 

Yes 

42 

43 

1 

2 

BMA1750 

HrpA 

3f 

Yes 

2 

BMAA0728 

TssN 

39 

Yes 

72 

87 

3 

2 

BMA2787 

ipgF 

3f 

Yes 

2 

BMA0267 

Pseudogene 

~ 

Yes 

10 

4 

1 

2 

BMAA0038 

Hypothetical  protein 

~ 

Yes 

2 

BMAA1662 

Response  regulator 

3f 

Yes 

19 

2 

BMA0429 

Cmk 

3f 

Yes 

63 

58 

4 

2 

BMA2469 

Tkt 

3 ' 

Yes 

24 

6 

2 

BMAA0238 

Hypothetical  protein 

2h 

Yes 

25 

28 

2 

2 

BMAA1173 

Hypothetical  protein 

~ 

Yes 

2 

BMA0886 

AcpA 

2  h 

No 

3 

BMAA1618 

Hypothetical  protein 

3f 

Yes 

3 

BMAA1619 

Hypothetical  protein 

3' 

Yes 

14 

9 

1 

3 

BMAA0679 

Chemotaxis  protein  CheC 

- 

Yes 

7 

3 

BMAA0553 

Ser/Thr  protein  phosphatase 

2h 

Yes 

59 

60 

3 

3 

BMAA1559 

Thermolysin  metallopeptidase 

2h 

Yes 

3 

BMAA0582 

Hypothetical  protein 

3f 

Yes 

4 

BMAA1566 

Serine-type  carboxypeptidase 

3f 

Yes 

4 

BMAA1641 

Hypothetical  protein 

3f 

Yes 

4 

BMAA0605 

Hypothetical  protein 

3f 

Yes 

4 

BMAA1111 

Hypothetical  protein 

3f 

No 

4 

BMAA1648 

Hypothetical  protein 

3f 

Yes 

9 

4 

BMAA1997 

Phosphatidylserine  decarboxylase 

3f 

Yes 

4 

BMAA2052 

Polysaccharide  deacetylase 

3f 

Yes 

4 

BMAA041 8 

NAD-dependent  deacetylase 

3f 

Yes 

a  Gene  located  within  the  loci  of  a  type  6  secretion  system  (T6SS)  clusters  (cluster  1  [cl]:  BMAA0744-0729,  cluster  2  [c2]:  BMAA0438-0455, 
cluster  3  [c3]:  BMAA0396-0412,  and  cluster  4  [c4]:  BMAA1897-1915)  (13). 
b  Type  5  secretion  system  effector  (auto-transporter),  located  adjacent  to  T6SS  cl . 

®  Predicted  phospholipase,  a  putative  type  2  secretion  system  (T2SS)  effector  (60). 

d  Gene  located  within  the  locus  of  the  type  3  secretion  system  (T3SS)  (animal  pathogen):  BMAA1 520-1 552  (14). 
e  T6SS-associated  effector  not  located  in  any  known  or  annotated  T6SS  cluster. 

'  T3SS-predicted  effector  not  located  in  any  known  or  annotated  T3SS. 

9  Predicted  as  a  T3SS  effector,  but  located  adjacent  to  T6SS  cl . 

h  Predicted  to  be  secreted  by  the  general  Sec  pathway  and  therefore  possibly  related  to  the  type  2  secretion  system. 

'  Protein  located  within  the  locus  of  the  putative  plant-pathogenicity-associated  T3SS:  BMAA1617-1640  (61). 
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BMAA1530  BMAA0445 


ZNF800 


PSMD8 


HSPA5 


NPC2 


IMAA0679 


BMAA1525 


ETFA  BPGM 


A0553 


BMAAlSSl  PSjMA6 


TRAPR4 


BMAA1269I 


BMAA1521 


BMA0666 


BMAA0445 


BMAAK 


BMAA1662 


BMAA1619 


Psmd8 


BMA2469 


>T\nll 


BMAA152: 


BMAA1269 


Ube2v2 


BMAA0729 


BMAA1521 


(3  B.  mallei  proteins 

^  Human-murine  orthologous  proteins 

O  Proteins  with  no  identified  orthologs  in  the  other  network 


—  Host-/?,  mallei  PPls 

—  Host-/?,  mallei  PPIs  conserved  across  species 


Fig.  2.  Y2H  host-pathogen  protein-protein  interactions.  Our  yeast  two-hybrid  (Y2H)  assays  against  whole  human  and  murine  proteome 
libraries  identified  600  protein-protein  interactions  (PPIs)  between  21  B.  mallei  proteins  and  409  human  proteins  (A)  and  846  PPIs  between  24 
B.  mallei  proteins  and  574  murine  proteins  (£?).  Green  nodes  represent  B.  mallei  proteins,  whereas  purple  and  red  nodes  represent  host 
proteins.  There  are  33  conserved  interactions  between  these  two  data  sets  (red  edges),  i.e.  in  33  PPIs,  human  proteins  (red  nodes  in  A) 
interacted  with  the  same  B.  mallei  proteins  as  their  murine  orthologs  (red  nodes  in  S). 


these  19  proteins  participated  in  interactions  that  were  con¬ 
served  between  the  two  different  hosts,  and  eight  of  them 
participated  in  more  than  one  conserved  PPI.  Three  of  these 
proteins,  PilA,  BipD,  and  BimA,  have  been  previously  charac¬ 
terized  as  virulence  factors  in  animal  models  (64-68).  The 
other  five  proteins  represent  targets  that  have  not  been  pre¬ 
viously  studied  in  animal  models  of  glanders  or  melioidosis,  and 
we  considered  them  as  promising  proteins  for  animal  model 
experiments.  Because  one  of  these  five  proteins  consists  of 
only  62  amino  acids,  it  was  too  small  for  the  creation  of  insertion 
mutants  and  was  removed  from  further  consideration.  Another 
protein,  cytidylate  kinase  BMA0429,  was  identified  as  an  essen¬ 
tial  B.  mallei  protein  [it  is  an  ortholog  of  STY0980,  an  essential 
gene  of  Salmonella  enterica  Typhimurium  (69)],  and,  as  such, 
was  not  likely  to  be  successfully  mutated  and  was  not  consid¬ 
ered  further.  Thus,  we  identified  BMAA1865,  BMAA0728,  and 
BMAA0553  as  three  relatively  uncharacterized  and  novel  B. 
mallei  candidate  virulence  factors. 

We  initially  examined  PPIs  among  these  three  pathogen 
proteins  and  host  proteins  to  identify  sets  of  overrepresented 
Gene  Ontology  (GO)  terms  and  Kyoto  Encyclopedia  of  Genes 
and  Genomes  (KEGG)  pathways  among  the  host  proteins.  We 
found  that  each  of  the  three  proteins  interacted  with  a  statis¬ 
tically  significant  (p  <  0.02)  number  of  host  proteins  that  are 


exclusively  part  of  pathways  related  to  ubiquitin  processing, 
receptor  signaling,  and  protein  processing  in  the  endoplasmic 
reticulum,  i.e.  they  interacted  with  host  proteins  that  are  pri¬ 
marily  part  of  these  pathways  and  do  not  appear  in  a  large 
number  of  other  pathways  (see  Experimental  Procedures).  In 
summary,  we  found  that  each  of  these  three  proteins  inter¬ 
acted  with  host  proteins  involved  in  biological  processes 
and/or  molecular  pathways  often  implicated  in  pathogenicity 
and  were,  therefore,  of  interest  for  experimental  validation  in  a 
murine  model  of  glanders. 

Mutants  Showed  Attenuated  Virulence  in  a  BALB/c  Aerosol 
Challenge  Model—  Based  on  the  above  evidence,  we  created 
separate  insertion  mutants  in  the  B.  mallei  ATCC  23344  strain 
for  each  of  the  three  genes  (BMAA0728,  BMAA1865,  and 
BMAA0553).  These  mutants  did  not  seem  to  have  any  ascer¬ 
tainable  in  vitro  growth  defect  and  appeared  fully  competent. 
We  subsequently  used  a  mouse  aerosol  challenge  model  to 
assess  potential  attenuation  afforded  by  these  B.  mallei  mu¬ 
tants.  The  fully  virulent  wild-type  B.  mallei  ATCC  23344  strain 
was  used  as  a  positive  control. 

Forty  BALB/c  mice  (10  mice  for  each  of  the  three  mutant 
strains  and  10  mice  for  the  wild-type  strain)  were  exposed  to 
inhalation  doses  of  >104  colony-forming  units  (cfu;  corre¬ 
sponding  to  >10  LD50  for  the  wild-type  strain)  and  monitored 
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Fig.  3.  Mouse  aerosol  challenge  model  results.  In  the  first  set  of  experiments,  40  BALB/c  mice  (10  mice  for  each  of  the  three  B.  mallei 
mutants  +  10  mice  exposed  to  the  wild-type  strain)  received  inhalation  doses  of  >2  X  104  colony-forming  units  (cfu),  corresponding  to  >10 
LD50  for  the  wild-type  strain.  In  the  second  set  of  experiments,  additional  40  BALB/c  mice  received  inhalation  doses  of  >2  x  105  cfu, 
corresponding  to  >20  LD50  for  the  wild-type  strain.  In  both  experiments,  animals  were  monitored  for  21  days.  At  the  end  of  the  first  set  of 
experiments,  all  30  mice  exposed  to  mutant  strains  survived  21  days  post-exposure  (pink,  orange,  and  blue  lines),  whereas  seven  mice 
exposed  to  the  wild-type  strain  did  not  survive  (green  line).  At  the  end  of  the  second  set  of  experiments,  all  30  mice  exposed  to  mutant  strains 
also  survived  21  days  post-exposure  (pink,  orange,  and  blue  lines),  whereas  all  10  mice  infected  with  B.  mallei  ATCC  23344  died  (purple  line). 
There  was  a  statistically  significant  difference  in  the  survival  rate  of  mice  exposed  to  mutant  strains  and  mice  exposed  to  the  wild-type  strain 
in  both  sets  of  experiments  (p  =  4  x  1CT3  and  p  =  9  x  1CT5  for  the  first  and  second  set  of  experiments,  respectively).  These  results  support 
the  notion  that  each  of  the  three  mutants  attenuates  virulence  when  infected  in  mice  via  the  inhalational  route  of  infection. 


for  21  days.  Fig.  3  shows  the  survival  curves  of  mice  exposed 
to  mutant  strains  and  mice  exposed  to  the  wild-type  strain. 
For  this  dose,  each  of  the  three  mutants  appeared  to  attenu¬ 
ate  infection,  as  all  30  mice  exposed  to  the  mutants  survived 
21  days  post-exposure,  whereas  seven  mice  exposed  to  the 
wild-type  strain  did  not  survive.  A  survival-curve  analysis  in¬ 
dicated  that  this  difference  was  statistically  significant  (p  = 
4  x  1CT3).  However,  because  not  all  animals  exposed  to  the 
wild-type  challenge  died  at  the  considered  dose  (>104  cfu), 
we  repeated  the  experiment  using  a  higher  aerosol  challenge 
dose. 

Accordingly,  40  additional  BALB/c  mice  (3  x  10  +  10)  were 
exposed  to  inhalation  doses  of  >  2  x  105  cfu  (corresponding 
to  >20  LD50  for  the  wild-type  strain)  and  monitored  for  21 
days.  Similarly,  we  found  statistically  significant  differences 
(p  =  9  x  1 CT5)  in  the  survival  rate  of  mice  exposed  to  mutants 
and  mice  exposed  to  the  wild-type  strain.  All  three  of  the  B. 
mallei  mutants  appeared  to  attenuate  infection,  i.e.  all  mice 
infected  with  the  mutants  survived  21  days  post-exposure, 
whereas  all  of  the  mice  infected  with  B.  mallei  ATCC  23344 
strain  died  within  7  days  post  exposure  (Fig.  3).  These  results 
support  the  hypothesis  that  each  of  the  three  mutants  attains 
a  virulence  defect  when  infected  in  mice  via  the  inhalational 
route  of  infection. 

DISCUSSION 

In  vitro  high-throughput  technologies  have  become  versa¬ 
tile  tools  for  discovery  and  hypothesis  generation  involving 
proteins  and  protein  interactions  that  underlie  infectious 
pathogenesis  (70-72).  These  technologies  have  been  suc¬ 
cessfully  applied  to  both  viral  (73-78)  and  bacterial  (21-23, 
79-82)  pathogens  to  shed  light  on  how  these  organisms 
interact  with  and  subvert  host  defense  processes.  For  exam¬ 


ple,  Li  et  al.  used  a  small  set  of  conserved  herpesvirus- 
encoded  protein  kinases  to  identify  key  targeted  host  proteins 
using  human  protein  microarray  experiments  (77).  The  analy¬ 
sis  of  these  interactions  identified  the  DNA  damage  response 
pathway  and  other  key  host  regulators  as  central  targets  in 
sustaining  the  viral  life  cycle. 

In  Vitro  Approaches  to  Study  B.  mallei  Pathogenesis— A  key 
component  of  B.  mallei  pathogenesis  is  its  ability  to  enter, 
survive,  and  replicate  within  the  host  cells.  Secretion  systems 
play  an  important  role  in  pathogenicity  of  the  Gram-negative 
B.  mallei  by  allowing  for  controlled  transport  and  insertion  of 
pathogen  proteins  into  the  host  cell  cytosol.  The  identities  of 
secretion  systems’  proteins  are  not  all  cataloged,  and  the 
interactions  between  specific  B.  mallei  proteins  and  host  pro¬ 
teins  and  other  host  factors  are  not  well  characterized.  Given 
that  most  Burkholderia  spp.  are  opportunistic  infections,  one 
cannot  expect  all  secretion  system  proteins  to  have  evolved 
host-specific  protein  interactions.  Instead,  the  bulk  of  the 
interactions  must  encompass  generic  host  target  interactions 
to  broadly  cope  with  multiple  and  variable  hosts,  e.g.  plants, 
nematodes,  and  other  soil  microorganisms,  as  well  as  oppor¬ 
tunistic  infections  in  mammalian  hosts.  Even  for  the  equine- 
adapted  and,  thus,  more  genetically  constrained,  B.  mallei 
pathogen,  we  cannot  expect  that  either  human  or  murine 
interactions  will  have  high  specificity.  As  Y2H  techniques  are 
sensitive  to  weak  interactions,  this  method  is  ideally  suited  to 
probe  interactions  nonspecific  to  a  distinct  host  (83).  Although 
many  Burkholderia  proteins  could  potentially  interact  with 
multiple  host  proteins,  nonsecreted  proteins  are  less  likely  to 
encounter  host  proteins  and,  hence,  focusing  on  secretion 
systems’  proteins  will  increase  the  likelihood  of  finding  bio¬ 
logically  important  interactions  relevant  to  establishing  the 
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infection.  Our  approach  was  to  use  in  silico  methods  to  spe¬ 
cifically  search  for  virulence  factors  that  are  likely  to  be  a  part 
of  a  secretion  system  and  gauge  their  potential  relevance  by 
identifying  important  host-pathogen  protein  interactions 
based  on  in  vitro  data.  The  in  vitro  Y2H  results  thus  represent 
an  important  source  for  generating  hypothesis  on  B.  mallei 
virulence  and  pathogenicity. 

Y2H  Detection  of  Host  Interactions— Table  III  shows  the 
number  of  interactions  between  the  tested  B.  mallei  proteins 
and  human  and  murine  proteins  as  well  as  the  corresponding 
number  of  conserved  interactions  in  terms  of  orthologous 
pairs  of  interactions.  The  small  overlap  between  murine  and 
human  prey  proteins  for  which  PPIs  were  identified  is  poten¬ 
tially  an  artifact  from  the  Y2H  experiments.  A  comprehensive 
detection  of  all  possible  host-prey  and  pathogen-bait  protein 
interactions  requires  exhaustive  repetition  (84)  and,  hence, 
our  detected  PPIs  represent  a  subset  of  all  interactions  that 
can  occur.  These  results  suggest  that  more  comprehensive 
screens  may  be  required  to  exhaustively  delineate  the  total 
set  of  possible  interactions  for  the  two  hosts. 

The  largest  numbers  of  interactions  were  detected  for  pro¬ 
teins  in  groups  1  and  2,  i.e.  for  proteins  with  multiple  evi¬ 
dences,  whereas  the  number  of  detected  interactions  de¬ 
creased  with  a  decreased  number  of  evidences  and  an 
increased  fraction  of  hypothetical  proteins  within  a  group. 
Thus,  the  proteins  in  group  4,  derived  from  ab  initio  prediction 
methods  and  annotated  as  enzymes  or  hypothetical  proteins, 
did  not  notably  interact  with  host  proteins  in  our  Y2H  screens. 
However,  a  lack  of  host  interactions  does  not  necessarily 
indicate  a  lack  of  a  potential  biological  role,  as  proteins  may 
function  as  modulators  of  metabolites  or  other  host  factors 
without  involving  detectable  protein  interactions. 

Known  Virulence  Factors  and  Their  Virulence  Phenotypes— 
Among  the  known  virulence  factors  shown  in  Table  III,  those 
linked  to  the  T3SS  in  group  1  interacted  with  at  least  one  host 
protein.  Some  of  these  proteins,  e.g.  BipD,  BipB,  BopA,  and 
BopE,  have  already  been  shown  to  be  required  for  different 
aspects  of  pathogenicity  in  different  animal  models.  For  ex¬ 
ample,  the  BipD  mutant  of  B.  pseudomallei  has  been  shown 
to  attenuate  infection  after  intraperitoneal  challenge  of 
BALB/c  mice  (67),  whereas  the  BipB  mutant  of  B.  pseudomal¬ 
lei  was  not  able  to  produce  multinucleated  giant  cells  during 
intranasal  challenge  of  the  same  host  and,  consequently, 
improved  host  survival  time  (19).  These  two  proteins,  along 
with  BipC,  are  positioned  at  the  tip  of  the  T3SS  needle  and 
form  translocation  pores  (56).  Furthermore,  studies  have 
shown  that  the  effector  (secreted  protein)  BopA  is  required  for 
intracellular  survival  of  both  B.  pseudomallei  and  B.  mallei  in 
phagocytic  cells  (56)  and  the  B.  pseudomallei  effector  BopE  is 
required  for  optimal  invasion  of  nonphagocytic  cells.  How¬ 
ever,  because  BopE  deletion  mutants  showed  hardly  any 
attenuation  during  intraperitoneal  or  intranasal  infection  of  B. 
pseudomallei  in  BALB/c  mice  (56,  67),  these  results  indicate 


that  multiple  effectors,  possibly  in  concert  with  BopE,  con¬ 
tribute  to  a  robust  infection. 

Group  1  also  contained  a  number  of  proteins  from  T6SS, 
the  majority  of  which  were  annotated  as  proteins  containing  a 
VgrG  domain.  A  number  of  large  VgrGs  (complex  trimeric 
proteins  containing  transmembrane  parts  and  a  C-terminal 
domain)  has  been  identified  in  Gram-negative  bacteria  and 
shown  to  be  secreted  and  interact  with  host  cells  (85).  The 
largest  VgrG  protein  in  B.  mallei,  BMAA0737,  is  an  ortholog  of 
the  B.  pseudomallei  VgrG  protein,  BPSS1 503,  a  protein  iden¬ 
tified  as  essential  for  virulence  (11).  Although  we  were  unable 
to  successfully  clone  BMAA0737,  we  identified  a  handful  of 
PPIs  for  three  of  the  five  VgrG  protein  constructs  we  could 
assay.  Two  of  them,  BMAA0445  and  BMAA0446,  belonging  to 
the  second  T6SS  cluster,  were  recently  identified  in  B.  pseu¬ 
domallei  as  important  for  interactions  with  other  bacteria  (86), 
whereas  the  third  one,  BMAA1 269,  was  located  outside  of  any 
known  T6SS  clusters  (1 3).  The  small  number  of  identified  PPIs 
for  VgrG  proteins  is  likely  a  consequence  of  difficulties  in 
expressing  large  proteins  with  transmembrane  domains  (87). 
Preliminary  data  also  suggest  that  recombinant  expression  of 
VgrG  proteins  in  Escherichia  coli  is  challenging  (K.  Kwon, 
unpublished  data). 

In  addition  to  T3SS  and  T6SS  proteins,  group  1  also  con¬ 
tained  BimA,  a  well-characterized  auto-transporter  of  B. 
pseudomallei  and  B.  mallei  that  is  exported  through  a  type  5 
secretion  system  (12).  The  role  of  BimA  in  actin-based  motility 
is  accepted,  but  the  mechanisms  are  not  fully  understood, 
and  it  is  likely  that  they  differ  between  B.  pseudomallei  and  B. 
mallei  (12).  In  B.  mallei,  BimA  was  not  required  for  pathoge¬ 
nicity  in  a  hamster  model  of  glanders  (13),  whereas  in  B. 
pseudomallei  it  was  found  to  contribute  to  pathogenicity  in 
murine  models  (12). 

Thus,  although  many  of  the  known  virulence  factors  in 
group  1  did  include  a  large  number  of  interactions,  their 
specific  biological  function  and  resultant  infectious  phenotype 
varied  dependent  on  both  the  animal  model  and  the  pathogen 
species  used.  Given  that  the  three  B.  mallei  proteins, 
BMAA0728,  BMAA1865,  and  BMAA0553,  all  similarly  attenu¬ 
ated  virulence  in  our  animal  model  and,  hence,  generated  a 
common  nonvirulent  infectious  phenotype,  we  now  turn  to  our 
Y2H  interaction  data  to  infer  and  discuss  the  putative  role  of 
these  novel  virulence  factors  in  invasion  and  survival  within 
the  host  cell. 

Novel  Virulence  Factors  and  Their  Host  Protein  Interac¬ 
tions— Focus'mg  on  the  three  novel  B.  mallei  virulence  factors 
and  their  interacting  human  and  murine  proteins  (Table  III), 
these  proteins  participated  in  192  and  236  PPIs  with  human 
and  murine  proteins,  respectively  (supplementary  Data  set  S3 
and  supplementary  Data  set  S4),  containing  170  unique  hu¬ 
man  proteins  and  191  unique  murine  proteins.  Fig.  4  graphi¬ 
cally  shows  these  protein  interactions  for  both  hosts.  There 
are  12  conserved  (or  overlapping)  PPIs  between  these  two 
data  sets,  containing  10  unique  host  proteins.  To  maximize 
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Q  B.  mallei  proteins 

^  Human-murine  orthologous  proteins 

Q  Proteins  with  no  identified 
orthologs  in  the  other  network 


—  Host-  B.  mallei  PPIs 

—  Host-  B.  mallei  PPIs  conserved 
across  species 


Fig.  4.  Host-pathogen  interactions  for  three  B.  mallei  virulence  factors.  Three  B.  mallei  virulence  factors  (green  nodes)  interact  with  1 92 
human  (A)  and  236  murine  (S)  proteins  (purple  and  red  nodes).  There  are  12  conserved  protein-protein  interactions  (PPIs)  between  these  two 
PPI  data  sets  (red  edges),  i.e.  in  12  PPIs,  human  proteins  (red  nodes  in  A)  interacted  with  the  same  B.  mallei  proteins  as  their  murine  orthologs 
(red  nodes  in  B).  The  host  proteins  (human/murine)  in  the  conserved  interactions  include:  activating  signal  cointegrator  1  complex  subunit  3 
(ASCC3/Ascc3),  epididymal  secretory  protein  El  (NPC2/Npc2),  sperm-associated  antigen  17  (SPAG17/Spag17),  ARCN1  protein-coatomer 
subunit-S  (ARCNI/Arcnl),  UMP-cytidine  monophosphate  kinase  (CMPKI/Cmpkl),  heat  shock  70-kDa  protein  5  (HSPA5/Hspa5),  bisphos- 
phoglycerate  mutase  (BPGM/Bpgm),  DnaJ  homolog  subfamily  B,  member  4  (DNAJB4/Dnajb4),  syntaxin-8  (STX8/Stx8),  and  thioredoxin-like 
protein  1  (TXNLI/Txnll). 


the  coverage  of  interactions,  we  combined  all  human-  and 
murine-S.  mallei  PPIs  detected  in  our  screens  in  subsequent 
analysis.  This  resulted  in  a  data  set  consisting  of  382  interac¬ 
tions  among  three  B.  mallei  proteins  and  336  human  proteins 
(170  human  proteins  +  166  human-murine  orthologs). 

Pathogen  Targeting  Ubiquitin-related  Processes— We  found 
that  all  three  proteins  interacted  with  a  statistically  significant 
[p  <  2  x  1CT3)  number  of  host  proteins  involved  in  ubiquitin- 
related  processes  (supplemental  Table  S5).  This  finding  is  note¬ 
worthy,  as  intracellular  pathogens  are  known  to  interfere  with 
host  ubiquitination  processes,  thus  disturbing  immune  signaling 
cascades  and  attenuating  the  immune  response  (88-91).  Addi¬ 
tionally,  pathogens  interfere  with  the  host  ubiquitin  protea- 
some  system  to  prevent  their  degradation  as  well  as  to  ensure 
their  destruction  when  no  longer  required  for  establishing  the 
infection  (88-91). 

One  of  the  major  known  B.  mallei  virulence  factors  in  the 
modulation  of  host  ubiquitination  system  is  TssM 
(BMAA0729),  a  protein  that  encodes  a  broadly  acting  deu- 
biquitinase  (17,  92).  Although  the  exact  function  of  TssM  is 
unknown,  it  has  been  suggested  that  it  acts  to  remove  ubiq¬ 


uitin  from  pathogen  proteins,  preventing  their  recognition  and, 
consequently,  the  activation  of  early  inflammatory  responses 
(64).  Using  the  PFAM-A  database  (43),  we  mapped  the  ubiq¬ 
uitin  C-terminal  hydrolase  (UCH)  domain  to  TssM.  Interest¬ 
ingly,  we  found  that  the  identified  BMAA0728  protein  (TssN) 
interacted  with  human  and  murine  proteins  that  contain  the 
UCH  domain.  This  implies  that  it  is  possible  that  BMAA0728 
also  interacts  with  the  UCH  domain  of  its  chromosome-adja¬ 
cent  protein  TssM,  a  protein  with  which  it  is  also  coexpressed 
(D.  DeShazer,  unpublished  data).  Even  though  we  tried  to 
identify  PPIs  for  TssM  (Table  III),  we  were  unable  to  obtain  a 
significant  number  of  interactions  for  this  protein,  despite  the 
large  number  of  interactions  detected  for  TssN.  We  further 
found  that  BMAA0728  interacts  with  the  polyubiquitin-B  pro¬ 
tein  (UBB)  and  with  the  cullin-la  protein  (CUL1),  a  core  com¬ 
ponent  of  multiple  cullin-RING-based  SCF  E3  ubiquitin-pro- 
tein  ligase  complexes.  From  the  available  human  PPI  data 
(42),  we  found  that  UBB  and  CUL1  physically  interact  with 
TNF  receptor-associated  factor  6  (TRAF-6)  and  l-kappa-B 
inhibitor  alpha  (IkB-a),  two  host  proteins  that  have  been 
shown  to  be  targets  of  TssM  (1 7,  92).  These  observations  lead 
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to  the  hypothesis  that  BMAA0728  either  interferes  with  host 
ubiquitination  processes  directly  or  in  collaboration  with 
TssM.  However,  based  on  currently  available  domain  anno¬ 
tations,  no  known  domain  could  be  assigned  to  BMAA0728, 

Pathogen  Targeting  Host  Entry  Mechanisms— We  identified 
Y2H  interactions  between  BMAA0728  and  GABA(A)  receptor- 
associated  protein-like  1  (GABARAPL1),  a  host  protein  pos¬ 
sibly  involved  in  the  closure  of  phagosomal  membranes  (93, 
94).  This  observation  implies  a  possible  role  of  BMAA0728  in  B. 
mallei  (auto)phagosomal  escape.  Besides  GABARAPL1,  we 
found  many  other  murine  and  human  host  proteins  that  are 
involved  in  processes  that  can  be  related  to  cellular  internaliza¬ 
tion  or  degradation  escape  and  evasion  and  that  interact  with  all 
three  proteins  (BMAA0728,  BMAA1865,  and  BMAA0553;  sup¬ 
plemental  Table  S5).  However,  based  on  the  available  data,  we 
cannot  determine  a  precise  role  for  these  three  proteins  in 
(auto)phagosomal  escape/evasion  processes. 

To  gain  entry  into  host  cells,  multiple  pathogen  proteins 
target  the  host  cytoskeleton  through  direct  interactions  with 
actin  cytoskeleton  regulators  (95,  96).  One  such  protein,  Rho 
family  GTPase  (guanosine  triphosphatase)  CDC42,  is  a  known 
pathogen  target  that  induces  actin  rearrangement  in  the  host 
cell  (64,  96-98).  Previous  studies  have  suggested  that  B. 
pseudomallei  BopE  (B.  mallei  orthologous  protein  BopE)  acts 
as  a  guanine  nucleotide  exchange  factor  for  the  Rho  family  of 
GTPases  and  that,  through  an  interaction  with  a  Rho  GTPase 
CDC42,  B.  pseudomallei  facilitates  membrane  ruffling  and 
uptake  into  the  host  cell  (64,  99).  Studies  have  also  implied 
that  multiple  pathogen  proteins  may  act  together  with  BopE  to 
facilitate  bacterial  invasion  (64,  99).  Our  study  identified  that 
BMAA1 865  also  interacts  with  RhoGTPases,  such  as  CDC42 
and  RhoH  (Ras  homolog  gene  family,  member  H),  as  well  as 
with  a  number  of  other  proteins  involved  in  processes  related 
to  actin  cytoskeleton  rearrangement  (supplemental  Table  S5), 
and  we  hypothesize  that  BMAA1865  represents  another  B. 
mallei  protein  involved  in  host  membrane  ruffling  and  bacterial 
invasion. 

Pathogen  Targeting  Phosphorylation  Signaling— Although 
the  B.  mallei  serine/threonine  phosphatase  BMAA0553  did 
not  interact  with  any  host  protein  already  known  to  be  in¬ 
volved  in  bacterial  invasion  and  survival,  we  found  it  inter¬ 
acted  with  a  large  number  of  host  proteins  involved  in  various 
signaling  processes  and,  hence,  it  is  likely  that  it  interferes 
with  host  intracellular  signaling  pathways  (supplemental  Table 
S5).  We  found  that  BMAA0553  interacts  with  Rho  GDP  (Rho 
family  of  guanosine  diphosphate)  dissociation  inhibitor-/3  (AR- 
HGDIB),  which  regulates  the  GDP/GTP  exchange  reaction  of 
Rho  proteins,  as  well  as  with  CDC42.  Furthermore,  as  serine/ 
threonine  phosphorylation  has  been  linked  to  actin  cytoskel¬ 
eton  rearrangement  processes  (97,  100),  it  is  possible  that 
BMAA0553  also  has  a  role  in  cytoskeletal  rearrangement. 

Understanding  Host-Pathogen  Interactions— Given  the  in¬ 
herent  nonspecific  protein  interactions  and  the  lack  of  ex¬ 
haustive  experimental  testing  of  protein  interactions,  our  ap¬ 


proach  was  to  delineate  the  possible  effects  of  select  B.  mallei 
protein  by  investigating  which  host  processes  they  would 
have  a  high  likelihood  to  interfere  with.  Indeed,  the  new  viru¬ 
lence  factors  we  identified  appear  to  have  a  multifunctional 
biological  role,  perhaps  because  of  nonspecific  interactions. 

The  nonspecific  nature  of  the  interactions  is  both  a  source 
of  weakness  and  strength  in  the  ability  of  the  pathogen  to 
establish  an  infection.  Avoiding  highly  specific  and  strong 
interactions  avoids  the  risk  of  being  limited  to  infecting  only 
those  hosts  that  have  the  requisite  specific  proteins.  How¬ 
ever,  it  requires  that  the  pathogen  maintain  many,  possibly 
alternative,  indiscriminate  interactions  to  achieve  the  same 
goal.  The  apparent  multivariate  capability  of  the  virulence 
factors  is  also  commensurate  with  the  notion  of  using  multi¬ 
ple,  possibly  overlapping,  interactions  to  influence  the  host 
environment  and  to  maximize  the  likelihood  of  establishing 
successful  infections  in  multiple  hosts  and  host  environments. 

CONCLUSIONS 

Our  study  is  the  first  study  to  specifically  and  systematically 
generate  host-protein  interactions  associated  with  known  and 
putative  virulence  factors  of  B.  mallei.  We  used  bioinformatics 
approaches  to  identify  49  proteins  involved  in  B.  mallei  path¬ 
ogenicity.  UsingY2H  assays  to  screen  these  proteins  against 
normalized  whole  human  and  murine  proteome  libraries,  we 
identified  a  large  number  of  host-pathogen  PPIs.  Analyzing 
these  interactions  allowed  us  to  characterize  host  proteins 
targeted  by  B.  mallei  proteins  and  to  identify  three  novel 
virulence  factor  candidates.  Using  a  mouse  aerosol  challenge 
model,  we  demonstrated  that  these  three  proteins  attenuated 
B.  mallei  ATCC  23344  virulence,  implicating  them  and  their 
host  targets  in  B.  mallei  pathogenicity. 
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